Centralized Congestion Control and Scheduling in a Datacenter

نویسندگان

  • Devavrat Shah
  • Qiaomin Xie
چکیده

We consider the problem of designing a packet-level congestion control and scheduling policy for datacenter networks. Current datacenter networks primarily inherit the principles that went into the design of Internet, where congestion control and scheduling are distributed. While distributed architecture provides robustness, it suffers in terms of performance. Unlike Internet, data center is fundamentally a"controlled"environment. This raises the possibility of designing a centralized architecture to achieve better performance. Recent solutions such as Fastpass and Flowtune have provided the proof of this concept. This raises the question: what is theoretically optimal performance achievable in a data center? We propose a centralized policy that guarantees a per-flow end-to-end flow delay bound of $O$(#hops $\times$ flow-size $/$ gap-to-capacity). Effectively such an end-to-end delay will be experienced by flows even if we removed congestion control and scheduling constraints as the resulting queueing networks can be viewed as the classical reversible multi-class queuing network, which has a product-form stationary distribution. In the language of Harrison et al., we establish that baseline performance for this model class is achievable. Indeed, as the key contribution of this work, we propose a method to emulate such a reversible queuing network while satisfying congestion control and scheduling constraints. Precisely, our policy is an emulation of Store-and-Forward (SFA) congestion control in conjunction with Last-Come-First-Serve Preemptive-Resume (LCFS-PR) scheduling policy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Datacenter Congestion Control: Identifying what is essential and making it practical

In its never-ending quest to improve datacenter performance, the research community has produced a continuing stream of papers on datacenter congestion control. These proposals differ along many dimensions, including how congestion is detected and signaled, how end hosts adapt to congestion, and how packets are scheduled by the switches. The result is a cornucopia of datacenter congestion contr...

متن کامل

A mechanism achieving low latency for wireless datacenter applications

Recently, several wireless/optical datacenter architectures are designed to overcome the drawbacks of wired datacenter topologies, such as expensive highend switches, high cabling complexity, congestion caused by a few hot nodes. Compared with wired switches, current commodity wireless switches usually suffer lower throughput as well as higher packet loss ratio and latency. However, todays data...

متن کامل

Flowtune: Flowlet Control for Datacenter Networks

Rapid convergence to a desired allocation of network resources to endpoint traffic is a difficult problem. The reason is that congestion control decisions are distributed across the endpoints, which vary their offered load in response to changes in application demand and network feedback on a packet-by-packet basis. We propose a different approach for datacenter networks, flowlet control, in wh...

متن کامل

Optimizing Job Reliability Through Contention-Free, Distributed Checkpoint Scheduling

A datacenter that consists of hundreds or thousands of servers can provide virtualized environments to a large number of cloud applications and jobs that value the requirement of reliability very differently. Checkpointing a virtual machine (VM) is a proven technique to improve reliability. However, existing checkpoint scheduling techniques for enhancing reliability of distributed systems have ...

متن کامل

TCP congestion control for wireless datacenters

TCP is a low cost and easy-to-use transport layer protocol widely used in datacenter based applications and web services. Many TCP congestion control algorithms have been proposed to improve the performance of TCP in datacenter networks. However, the emerging wireless technologies in datacenter networks create new problems for TCP congestion control. On the one hand, TCP algorithms must suit th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1710.02548  شماره 

صفحات  -

تاریخ انتشار 2017